Type and token bigram frequencies for two-through nine-letter words and the prediction of anagram difficulty.
نویسندگان
چکیده
Recent research on anagram solution has produced two original findings. First, it has shown that a new bigram frequency measure called top rank, which is based on a comparison of summed bigram frequencies, is an important predictor of anagram difficulty. Second, it has suggested that the measures from a type count are better than token measures at predicting anagram difficulty. Testing these hypotheses has been difficult because the computation of the bigram statistics is difficult. We present a program that calculates bigram measures for two-to nine-letter words. We then show how the program can be used to compare the contribution of top rank and other bigram frequency measures derived from both a token and a type count. Contrary to previous research, we report that type measures are not better at predicting anagram solution times and that top rank is not the best predictor of anagram difficulty. Lastly we use this program to show that type bigram frequencies are not as good as token bigram frequencies at predicting word identification reaction time.
منابع مشابه
Citation for Published Item: Use Policy
Six previous studies of the variables affecting anagram solution are reexamined for the evidence that number of syllables contributes to solution difficulty. It was shown that the number of syllables in a solution word was confounded with imagery for one study and with digram frequency for another. More importantly it was shown that the number of syllables has a large effect on anagram solution...
متن کاملThe role of syllables in anagram solution: a Rasch analysis.
Anagrams are frequently used by experimental psychologists interested in how the mental lexicon is organized. Until very recently, research has overlooked the importance of syllable structure in solving anagrams and assumed that solution difficulty was mainly due to frequency factors (e.g., bigram statistics). The present study uses Rasch analysis to demonstrate that the number of syllables is ...
متن کاملEstimating the Parameters for Linking Unstandardized References with the Matrix Comparator
This paper discusses recent research on methods for estimating configuration parameters for the Matrix Comparator used for linking unstandardized or heterogeneously standardized references. The matrix comparator computes the aggregate similarity between the tokens (words) in a pair of references. The two most critical parameters for the matrix comparator for obtaining the best linking results a...
متن کاملCase-sensitive letter and bigram frequency counts from large-scale English corpora.
We tabulated upper- and lowercase letter frequency using several large-scale English corpora (approximately 183 million words in total). The results indicate that the relative frequencies for upper- and lowercase letters are not equivalent. We report a letter-naming experiment in which uppercase frequency predicted response time to uppercase letters better than did lowercase frequency. Tables o...
متن کاملDefying the stimulus : acquisition of complex onsets in Polish
1.1 Token frequency For completeness, this section demonstrates that relying on token frequency instead of type frequency does not provide a way out for the lexicalist hypothesis. Figure 7 shows the association between accuracy and tokenfrequency measures, analogous to the figures above for the type-frequency measures. The results are similar, albeit less promising: token segmental bigram frequ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Behavior research methods
دوره 43 2 شماره
صفحات -
تاریخ انتشار 2011